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COR RF JT EEN ' NG A " RAYS ° F NUCLE °-"^ TO DETERMINE 
CORRESPONDENCE WITH BOTH SEQUENCE AND PHYSICAL PROPERTIES 

OF A PROBE 

STATEMENT AS TO FEDERALLY SPONSORED RESEARCH 

msinvention was made wth Govemmem suppoft ^ 

grant number HHZ419 flhd ^ Howgrd Hughes Medjca[ - 
Government may have certain rights in this invention. 

Introduction 

Background 

A large n^,, „, „ ^ 

<*« «« pfcn,; « mesl8nge , RNA$ 

d^Mopment of ne»el therapeutic and dkgnMa. 

'IT. JT" "« hM — 

toWed 8 ^P*M are hybridized to ttiearays, the abtindance of specific 

«»r»| tl ,a,«»pos» 1 o„ cfl heco m p Km e™a, prob8 . while ^m^^Ln 
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also be wed for rapid <, eM o„ of s „ uence 

M* of toe tderapeufe ha(e teen „ 
MM« , ave ^ ^ ^ ^ ^ » 

f ' hU ™ n ^ — -Mr. 

ON*-, or agents « WM t0 sur(Ke 

*a*n of corona, merawfa are atso directed at eeilWaoe m.^ „* 
as receptors, channels and transporters. 

Secreted proteins are also of particular utility in the development of clinical 
10 Aagnostictests. .dentrncation of secreted proteins spedfic to tumors wouKallow the 
development of non-invasive blood based assays along the lines of the PSA screen 
for prostate cancer, that could be used to screen h«h risk populations. Early detection 
of the d,sease snou.d allow early treatment leading to better survival statistics 
Furthermore, recent advances in monodona. antibody therapy call for the identification 
of new tumor-specific membrane associated markers that could serve as targets for 
monodona. antibody therapies. AJso, antibodies against secreted mediators of 
■nflammation or other pathology processes, e.g. cytokines, chemokines, etc., could 
neutralize these offending agents. 

,de ^°"°'proteinshaving^ 
o other fields of biochemical study, include the possible identification of novel 
mo.ecu.es fcvohved in inflammation or Uta, of new receptors and signaling 
molecules important in ceHular interactions during developments processes The 
partial sequences of the tens of thousands of different cDNAs that are currently 
ava,lab«e in public databases are certain to indude many gene products wfth various 
. Phys«a. attributes. However, these cannot always be identified by their sequences 
alone. 

For example, the sequence of transmembrane regions and signal sequences 
are heipful, but not decisive in identifying proteins that are membrane bound or 
secreted. However, the information in EST databases is typica.lv partial sequences 
correspond** to the 3" region of an mRNA, which may not indude the sequences of 
transmembrane regions or signal peptides that could be recognized using sequence 



10 



WO 00/03037 

PCT/US99/I5355 

based rules. Addrt.ona.ly. even when the full-length primary sequence is Known the 
sequence^rediction methods are very unreliable in predicting whether a given protein 
is secreted or membrane bound 

A high throughout method for determining which gene products faO into defined 
funcbonal classes based on properties of the pro important class would therefore be 
of great value. 



Relevant literature 

The complete genome sequence of a number of organisms may be found at the 
National Center for Biotechnology Information, http:/A^.ncbl.n,m.nih.gov/Entrez/ 
Genome/org.htm,. The avai.abi.ity of sequences of genes of the human genome is 
d-scussed in Schuler (1996) Sdence 274:540. The complete sequence of the genome 
of S. cerevisiae is available at several Internet web sites, and is discussed in Goffeau 
et al. (1 996) Science 274:546. 

A number of methods are available for creating microarrays of biological 
samples, such as arrays of DMA samples to be used in DNA hybridization assays 
Exemplary are PCT Application Serial No. WO95735505, published December 28 
1995; U.S. patent no. 5,445.934. issued August 29. 1995; and Drmanac et al.. Science 
260:1649-1652. Yershovefai (1996) Genetics. 93:4913^918 describe an alternative 
construction of an polynucleotide array. The construction and use of polynucleotide 
arrays is reviewed by Ramsay (1998) supra. 

Methods of using high density polynucleotide arrays are known in the art. For 
example. Miiosavljevic et al. (1996) Gerties 37:77-86 describe DNA sequence 
recognition by hybridization to short oligomers. The use of arrays for identification of 
„ 25 unknown mutations is proposed by Ginot (1997) Human Mutatis int.™ 

Quantitative monitoring of gene expression patterns with a complementary DNA 
microarray is described in Schenaef al. (1995) Science 270:467. DeRisiefa/. (1997) 
Science. 270:680*86; DeRisiefa/. (1999) CurrQpin fWn. li(i):76-9; and Iyer era/. 
(1999) Scjeose 283:83-7. for example, explore gene expression on a genomic scale 
Wodicka et al. (1997) !MJm§cj L 1511-15 perform genome wide expression 
monitoring in S. cerevisiae. 
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Summary of the Invention 

dassZr' 5 " Pr ° Vided SCreeni " 9 " UCteiC 3Cid ™K a..ow 
das rfcat™ of target sequences: by their hybridan to a probe solution of suftabte 
labeled nucleic acids, and by physica. attributes associated wrth that probe The 

ri sa T ab : ,ed ^^ 

of drfferent nucleic acids. The source sam P ,e is donated prior to .abeling based on 
a phys,ca< attribute, e.g. association wKh a membrane bound nbosome. association 
wrth muffipte polysomes, association with specific molecules, e.g. proteins; subce.iu.ar 
ocafization.efc. Each traction of interest is labeted. The probe or probes is hybridized 
to an array, and the labe«s present on the probe or probes is detected. Nucieic acids 
present on the array are scored for the presence of the labete. Based on this 
.nformafon. target nucieic acids present on the array are characterized as to their 
correspondence with a desired physical attribute. 

in one embodiment of the invention, the methods provide rapid differentiation 
beKveen sequences that code for solubte intracellufcr proteins, and proteins that are 
ether secreted or associated with celluiar membrane structures such as the piasma 
membrane. Probe fractionation for these sequences relies the physica. association of 
membrane bound polysomes with mRNA encoding secreted or membrane bound 
proteins. 
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Brief Description of the Drawings 
Figure 1 shows a schematic of polysome isoiation and DNA microarray 



40 25 Description of the Specific Embodiments 

Methods are provided for characterizing target nucleic acids that are present on 
ananay. Target sequences are identified based on their hybridization to a probe The 
probes are .abated in order to provide information about a physica. attribute, e.g. .ength 
of mRNA, association with ribosomes. sub-ce.iu.ar .ocalization. etc. The probes are 
obtamed from a source sample that is fractionated based on such a physica. attribute 
The fractions are men differentially labeled. The probes hybridize to an array, and the 
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,o ; - • — — — . -i^^:"zrrr 

e0 ni,.«« , rf 9181 many dete «abte labels useful in 

conjugating nucleic acids are available and Mn k» , 

dh/ ent H» m «. w USed t0 provide combinatorial 

diversity In the subject methods. 

11» source m,** toa ma/ ^ ^ or 

^M*^,, Analysis of the hybridized array for relative 
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Source Sample 

Trreaorraeareplep^esfte,,^,,,,,,., In a oreferred 

r^rir r^-* -"A^^^ 

indirectly labeled, or amplified prior to labeling 
ttafVacoo^fo,,^^, ^^^^ 
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amount T^' ^ S ° Urce " ^ 3 ^ Where a 

amount of genome or cDNA sequence information is avai.abte. A number of 

oma„ 1Sms wrth complete known genome sequences, e.g. Aauifex aeoli J 
Arcnaeojoous a-*. fl0TO , a burgdorferf; Eschen . chja ^ 

MetHanococcus Jannascnu; Myco^ genitallum . 

Saccnaromyces cerews/ae; Synechocys,* PCC6803; and onanisms with substantia, 
sequence and mapping information known, e.g. Arabioopsis thatiana; Caenomat.itis 
elegans; Dmsoph „a ^anogaster; Homo sapiens; Leistomania major; Mas muscutos- 
Oryza sativa; Zea mays, etc. 



25 15 



Fractionation 

The source sample is fractionated according to a physical property. Where the 
source sample is mRNA, the physical property may provide information about the 
encoded polypeptide. For example, secreted and membrane bound mRNA species 
are typ.ca.ly associated with membrane bound polysomes. Amatively the 
20 polysomes may be fractionated by precipitation with anybodies. e.g. pan-specific 
ant.bod.es that recognize a peptide motif or domain of interest. SubceNufcr 
locateation. association with specific proteins, nuciear transport, and transanal 
regular are all properties that can be measured by physical fractionation of mRNAs 
Furthemrcre. by fractionating total mRNA populations on the basis of size using 
25 denaturing geWectrophoresfe. foHowed by fcbeiing effractions representing different 
see classes and hybridization to microarrays. the size of the mRNA represented by 
each unknown gene represented in a microarray can be determined. 

In one embodiment of the invention, cytoplasmic and membrane bound 
polysomes are separated. This can be accomplished via various subcellular 
30 fractionation protocols, including ones that are detergent or densrty gradient 
centnfugatjon based. A description of the density gradient based isolation processes 
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^,^? n ^T 7,M * En2vm ' ,, ,52:241 - 247 

"■Ihememtare^oundpoljsoniaKraaioii. . 
be performed n content*, win, the SMcM!lr taaionafi „ _ mlu " 

~ * - * - - ■« - ruxtr 

»^*»"»»d*by,hen,RNA An^unopr^ta^^to 
"* Fo, exampte. see and ^ .u^jj^ 

irr - ( ' 972, 

pho*h*enr«; and8le(ka . r^^,^ %^ 

ZiTT ^ technique p.™* „ 

**" ""oVpeptMe, encoded by genes represented „ „ amy 

-"^thes^p^^. n»*.»™* --1 J 11 J! 

itr^^' " -* """" ma * - «» «< 

of (hat mRNA, where a greater number „ ntosomas aw piMenl 

^»«.pr w * J g,ccond»on.of W ere s ,. Any suitable method may be used to 
'rectionate the polysomes, such as sucrose gradients and the like 

T" te fra *" ated acco,din9 to ten * , " m *. - 

length of the mRNAs that they represent. 
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Amplification 

5 stand , . !, ^ ^ "*»"*">» •» *»" * compte™*™ DNA 

th* subject DNA sequences. „ „, ^ 

Alternatively, mRNA may be amplified by the methods described^! 
^ US. Paten, no. m Van ^ „ „ 8yWh J^ ; 

PO^asep^^. Ah^seRNA.^^^^^ J 
Polvmeres. capable of binding to g. p rom «er reg™. 

Labeling Fractions 

Aa *^^^b.ineh,dM i „, uai ^ acMollreMi 
conned to the ^ ate ^ in 

SUM. in** taro*.™. fluoresce* feahtocyan.,, (FITC) 

JiTftrr Mta, °*^^ w. c^^x^od^ 

(5-FAM) or N,N,N',N'-tetramethyf-6- catboxyrhodamine (TAMRA); etc 
M n»^l^^ ! ^ fc,m ^'" alWISarel ™»"»«»«Fo, 

^^-*-»-»--»"-----. O^uonxh^sare 
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The labe, ma, afeo te an ^m, syMim , „ tee me 

— IM, Partner k ^ to a daeaab* ^ 

s Mem,^. te pM „, nuc|eMMes ^ . fc ^ J -» 

■ncvporatt ft, lata, lnt0 ft, ^.^^ ed. so „ to 

A guMdy « easily debate signal k prefeTCa , ,„„ , uorescen , 
«* P-obe » p^. Ofte, sdtab* ^ hdude teavy ^ 

» ~» 'abefc. enz^e Wed ^toacSve *be,s. a* ^ L,' 

LTl' . te *"* *««<™<«° — M* can be 

smultanaoualy "»»sijred in one hybrtdizabm 

^n,^^a„ !OThtew ^^^^^ 

*. can be a, simp,, a, ,ab* s Mct , ^ , ^ Juofoc|ii ^ 

«cn .acdon can be ccnpa™, to ft, same cemmon refeence by 
^d^Mividdalnacta.^.,*^,,^ ' 

»^ " fereTOn " Xt,re to » « ""■"I". ft. taacence rati, o, 
«e lTO »^a,^, mV e^ ( s«ne Ria , (a ,. (1sl97) ^ for 

^*>^-^P,obe) ,n Ws way.a M ^ . 

« — Size, can be rciated „ a sense of fracbans, and 

identified in an array. 

•"«"»• case,, , may „ MranttgMtls te ^ ^ ^ ^ 
ITTT* ™° " «*•"»• «•». each taaon „ „_ 

^r B »ad*^ bknuoro ^ K ^ tort ^ |t ^ ^ 
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quantitative properties such as m p Wi • ktoswsw 

s rxtr 

attached to particles Hioh d P ° r may te 

P 3 ™^- High density microarrays of pc>lvnucteotM<« ,~ o 
and are commercially available. P^^Wes are known m the art 

The sequence of polynucleotides on the arrav will 
1 0 4 , 1 0 s or mom Hi«» M », sequences, and may comprise 

"«« no. 5,,x«4 (Rnung „ „. s . ^ , ^ ' " U 8 

amatn, pa ^ m . By use „ ^ ^ ^ ^ 

sr~^'=="*-'=.:==: 
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Cty, CaW. AJternatively, amplification reactions are u.pH -m 

„, , Iens8l haw 6ee „ ws . 

™ r t* aMchM ,o a * — — ~ * 

automatM in any of a number of ways. 
or.oon,bina t .„ofanyofme S e.orany«he, eTOC <lln ( , S y stem . 
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Hybridization 

■j-^' 6 n 5^' r ^'2ation conditions between probe and target is selected such that the 

^cseouancaein™^ m WcM> Mlmalceilsa „, « ' 

»M»m 256.1 18-122). An alkylammonton buffer tends bwffinimfee 
^„hy bi d MMr3teaM!taK%tetoGCewieni " 
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5 

Detection of Signal 

analyrsis. For any particular airavei^n* «. * * ^sequent 

probe is corneal t„ *1 ° * flUOreSCent one 

:—========= 
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a large number of fluorescence data points may be collected The 

position. 9 MrTespondi «* acid attached to that 
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Corresponding full-length Sequence 

==r :=— ~= 

Wet.dBWCMdSp^^p,., .cm^h^c. NY. Tile cDNA can 
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leaothcDrWth* ^^"^"-"""^wnceframtaM 

TV (Wll) bckg over the pdyA tail of eukaryotic mRNA producing first standsymtiMte 
™^a»"*c^compo 8 iS<»«c a nb e u SMB , fo , K ^ plei)TOjuce 

* maj? '° " e ,, ™ te ' Sto0d *" "* k "°> <° •» Particular 
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scope of the ores-nt in™,-- ^ V ' not mtend ed to limit the 

«s used herem the angular forms "a", "and" and -w i„ . ^ . 

- MM* o, sue c* w ^rence^tr T *° ^ 

methodologies that are described in ,h. T ' C ° nStrUCt8, and 

Experimental 
Large-Scale Identification of Secreted and M«,h„ 

d and Membrane-associated Gene Products 
using DNA Microarrays 

^^ am *^tionofthe P rotc*dde^ 

isolate rER-bound polysomes from Jurkat cells, a human 
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leukemic T cell En and . ^ms ms35s - 

— *. ^jzzzzzzr e ' - (,s95> 

mRNA samples from »ast _ ! P J "*« 081,5 "* 

•» **» «perto« coined 5^™^' ^ — *>< 

— .MM ^ the ScWya, p^l " -* -» *™» 

■WWi ttat deserts a. «*J , , " ? """" 
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Uncertain 
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Uncertem. ESTs are novel cONA sequent whfch^ 1. searehes and were listed as 

. secreiea, or ER/Golgi-resident proteins. These reoiiite a « * 

«fc«w»-bo,ino polysomes «», purtS «i ^ 
»«• « «>» set) coo* fer 
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Table 2 



toe: 200 
genes 



* genes 



129 



20 



162 



reticulum. Golgi. vacuolar, lipid partdw ^ s ,r«~^? ' Vaactes of system. endooS 

[^ ne > •% Me^,^^^^ extracellular. ££££ 

Membrane-associated" categories'^ " 9 P 6 " 58 "^ of the sum of the Tritf and 

In conclusion, we have found that isolation of mRNAmoio , u 

«ca. .merest Some of the proteins identified in this way are litefv to 
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Materials and Methods: 
Jurkat mRMA FrartinnsKon- 

^n^lafen: Grow up ^ ^ 

a :^ e **" f ° r 10 "* « 2800 ■» (n After first round of centrifugal 
- W*. Repeatasnecessa*. Wash cells 2x with PBS supplemented with 50 ull 
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sucrose rK 150 M onto the gradient Balance 
tubes by adding 1.3 M sucrose TK m a , , centnfuge 
m sucrose TK 150 M as needed • Centrifuge for 15 hours at 90 Mr* 
Harvestgradients by puncturing bottom of centrifuge tubes wrth an iZ 

^o„ 8 at 2 60nm to determine presence of nJLS^Z**""" 
« mRNA wi,. be present in me ioad zone while rnemZt, ^ *" 

-* - - at the interface 

Alternately ft,™-*-™^ ™ 13M sucrose M«Ps- 
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b*r, » m « (PH ,4,, ,„ mM potesskm 

ml cornea » US ' nS " P ' e<00 ' e< ' ^ and » • 15 

n^TTl ? " ma ""' 9 * tx 8 mh to ». 

£££ wTr W ' Sa ™ °™ sta * « ~^ - 

*oxyd»,at. ,o a fral a 02% a 

SETT - **'" 1 
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WHAT IS CLAIMED IS: PCT/US99/I S3SS 

1 A method of screening an polynucleotide arrav th« 
fractionatinaasn,.^,., . oirae a " a y- *e method comprising: 

20 



-21- 



WO 00/03037 

PCT/US99/15355 

9. The method according to 8. wherein said physical property is size of the 
polysome-mRNA complex. 

5 , OH.?,. ^ ^ a<XOrdin9 t0 C ' aim 6 ' Wherein said P"**- P^erty is 
5 length of the mRNA species. 

^ aCCOnJin9 t0 C ' aim 4 ' Wh6rein S8W ,abeli "9 «"■>*■. 
amplification of said fractionated mRNA. 



10 



12 The method according to Claim 11, wherein said labeling comprises 
incorporation of label during amplification. 



13. The method according to Claim 4. wherein said polynucleotide array 
comprises cDNA sequences. 



15 



14. The method according to Claim 4. wherein said polynucleotide, array 
comprises cloned or amplified segments of genomic DMA. 

15. The method according to Claim 1, wherein said polynucleotide array 
20 compnses a planar array of at least about 10* different sequences/cm*. 

16. A method of screening an polynucleotide anray for the presence of 
PoW-nuc.eo Me sequences corresponding to secreted or membrane-bound 
polypeptides, the method comprising: 

25 ^ rtton *"9 a*™™ sample o^^ 

with membrane-bound ribosomes; 

labeling each of said fractions with a different fluorochrome or predefined ratio 
of two fluorochromes. such that each fraction is spertra.ry disfngushabte from every 
other fraction, to provide probes; 



30 



hybridizing a polynucleotide array comprising polynucleotide sequences with 
said probes; 
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quantrtating said spectrally distinguishable labels at ^ « 
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INTERNATIONAL SEARCH REPORT 



CLA SSinCATION OP SUBJECT M*tt»d~ 
IK(S) :Cl2Q|/ 68 au *' E CT_MATTER_ 
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